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SEQUENCE LISTING 



(1) GENERAL INFORMATION 
(i) APPLICANT: Chatterjee, Subroto 
(ii) TITLE OF THE INVENTION: RECOMBINANT N-SMASES AND NUCLEIC ACIDS 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dike, Bronstein, Roberts & Cushman, LLP 

(B) STREET: 13 0 Water Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Corless, Peter F 

(B) REGISTRATION NUMBER: 33,860 

(C) REFERENCE/DOCKET NUMBER: 46906 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-523-3400 

(B) TELEFAX: 617-523-6440 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 



ENCODING SAME 



(iii) NUMBER OF SEQUENCES: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1197 base pairs 

(B) TYPE: nucleic acid 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



ATGATGACAT 
GCACTTGAAA 
TTTGGCATTG 
TTTGGTGCGC 
TGGTTTGTCC 

accgtatctg 
accactagtc 
accttagcca 
ttatccgaca 
tatcatgtgg 
gtggctatgt 
Qacggatggcg 
cattgaacatc 

nJCTAAAGGTGC 
fflCCAATTACAG 
mCCAACACCAG 
mGAGAAGACAC 
SjGGGGAATTTG 

^Jttatttgctg 
."gggttgaaat 



ATCACGAAAC 
CAACTGAATT 
GCGCAATCGC 
AGTCATTTGA 
CCGAAGTGAT 
ATTTTACGAC 
ATGTGACAGA 
TCGATCAAAC 
CGTTACGACT 
TTTTAAAGCG 
CAGGTGGTCA 
CTGACGATAT 
AATATGTCGT 
CGGCCATGCC 
GGGACATTGC 
CACTGGGTGG 
CTCGTGGCTT 
TGGTTGGCAT 
GTGCCGGGAT 
TTGAACCCAT 



GCGCGCGTTG 
TGGCGCTTAC 
TACGGCAAAA 
TGAACAAGAG 
GGTGACCATC 
GTGGCTGGCG 
TGAAGTGGAT 
CTTAGCCAAA 
GGCACAAATT 
ACATGATGAA 
GATCGCTACG 
CGCGTTAGGC 
GGCAAGTATC 
AAGTTTACTC 
GGCACATTTA 
CGTCCCACGT 
GTTTGCAGGT 
CCGTTCCATG 
TGTGGCTGAC 
GCGGCAATTG 



GCTCAAAGCG 
TTTGCGACAC 
ACGGCTCAGG 
TACCCGCAGT 
GCGGCAGATA 
CAGTTCGTGC 
TGGATCGAAC 
GTCGTTTTTG 
ATTCGTGCGT 
TTGTTTATTT 
GCGGCGGTCG 
GAAGCGTTGT 
ACGACACGCT 
AAAAATAAGC 
AGTGTGACCG 
GAAGCGGCCC 
CCTATTGGCT 
TATGTGAATC 
TCCGATGCGC 
TTAAAGGACT 



J. ± r\\^.c\\J\~.J^ 


-L \ -L ±\ X V_ 


c n 
o u 


V-. VJ V— X \JT\ ± \Jr\. 




X u 


J. X Jrt.V— X"SJT.\JVJ 


X wV^VJVJ X X VJ X X 


1 fin 

X o u 


PTfiAATTnAT 

>w X wxnx^ X X X 


VJVJV^VJVJVJ X X X X 


^ 'x U 


A A ATP A^'P'T'T 




oUU 


CAAAACAGCC 


AAATACGGTG 


360 


GGACAGAGAA 


TTTGATTGAT 


420 


GTCGGCAACA 


GACCCTGCAG 


480 


TAGCTGAGCA 


GGCGAATACG 


540 


CAGCAACACC 


GGAACGGTTA 


600 


CTGGGACAAG 


CCGGCGCGGG 


660 


TAGCCAGTCA 


GAAAAACCGC 


720 


TGCAAGACGT 


GACGACGTCG 


780 


AAGTTCAGCA 


TTTGTACACA 


840 


CGATTGTTGA 


CCGCTTGCAT 


900 


TGTATTACAT 


TGCGACCCAT 


960 


ATTTTACCGC 


AGATAATAGT 


1020 


AAACGCAGCG 


ACGAGCAACT 


1080 


AACAAGAATA 


TGAAGAAACT 


1140 


ACAATCATGT 


CGAATGA 


1197 



^ (2) INFORMATION FOR SEQ ID NO: 2: 

J (i) SEQUENCE CHARACTERISTICS: 

f (A) LENGTH: 3 97 amino acids 

i (B) TYPE: amino acid 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:" NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Met Thr Tyr His Glu Thr Arg Ala Leu Ala Gin Ser Asp Leu Gin 

15 10 15 

Gin Leu Tyr Ala Ala Leu Glu Thr Thr Glu Phe Gly Ala Tyr Phe Ala 

20 25 30 

Thr Pro Ala Asp Asp Thr Leu Arg Phe Gly lie Gly Ala lie Ala Thr 

35 40 45 

Ala Lys Thr Ala Gin Ala Leu Gin Gly Ala Val Phe Gly Ala Gin Ser 

50 55 60 

Phe Asp Glu Gin Glu Tyr Pro Gin Ser Glu Leu Met Ala Gly Phe Trp 
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65 

Phe Val Pro Glu 

Gly Ser Asp Thr 
100 

Pro Lys Gin Pro 
115 

Asp Trp lie Glu 
130 

Gin Thr Leu Ala 
145 

Ser Asp Thr Leu 

Ala Asn Thr Tyr 
180 

Ser Ala Thr Pro 
195 

Thr Ala Ala Val 

_ 210 

OAsp lie Ala Leu 
^225 

fUGlu His Gin Tyr 

ryThr Thr Ser Leu 
ffl 260 
SjGln Val Gin His 

"Leu Ser Val Thr 

U 290 

rlGly Gly Val Pro 

j^305 

^"^'Lys Thr Pro Arg 

'^Asp Asn Ser Gly 
^ 340 
Gin Thr Gin Arg 
355 

Asp Ser Asp Ala 
370 

Pro Met Arg Gin 
385 



70 



V dX 




Vd J. 


X iJ.X 


O 3 








V ex. J. 


OCX 




It lie 


Asn 


Thr 


Val 


Thr 








120 


7\ y*/-T 


J. liX 


ox Li 


i-ioXl 






X J D 




j-iy o 


Val 


Val 


Jtr Xitr 




150 








Leu 


Ala 


CZl n 

OXli 


X o o 










Val 


Val 


XlC Li. 


Glu 


Arg 


Leu 


Val 








200 


Ala 


Gly Thr 


OCX 






215 




Olv 

\j JL y 


Glu 


Ala 


T i^n 




230 






Val 

V d -L 


Val 


Ala 




OAR 










Val 


Pro 


rtXd 


Leu 


Tyr 


Thr 


Pro 








280 




He 


Val 


7\ d r~\ 






295 




/ix y 


Glu 


Ala 


i\xa 




310 






wx y 


Leu 


Phe 




"3 O C 

o b 








Glu 


Phe 


Val 


Val 


Arg 


Ala 


Thr 


Leu 








360 


Gin 


Gin 


Glu 


Tyr 






375 




Leu 


Leu 


Lys 


Asp 




390 







75 

He Ala Ala Asp 
90 

Thr Thr Trp Leu 
105 

Thr Ser His Val 

Leu He Asp Thr 
140 

Gly Arg Gin Gin 
155 

He He Arg Ala 
170 

Lys Arg His Asp 
185 

Ala Met Ser Gly 

Arg Arg Gly Thr 
220 

Leu Ala Ser Gin 
235 

He Thr Thr Arg 
250 

Met Pro Ser Leu 
265 

He Thr Gly Asp 

Arg Leu His Pro 
300 

Leu Tyr Tyr He 
315 

Gly Pro He Gly 
330 

Gly He Arg Ser 
345 

Phe Ala Gly Ala 

Glu Glu Thr Gly 
380 

Tyr Asn His Val 
395 



80 

Lys He Thr Phe 
95 

Ala Gin Phe Val 
110 

Thr Asp Glu Val 
125 

Leu Ala He Asp 

Thr Leu Gin Leu 
160 

Leu Ala Glu Gin 
175 

Glu Leu Phe He 
190 

Gly Gin He Ala 
205 

Asp Gly Ala Asp 

Lys Asn Arg He 
240 

Leu Gin Asp Val 
255 

Leu Lys Asn Lys 
270 

He Ala Ala His 
285 

Thr Pro Ala Leu 

Ala Thr His Glu 
320 

Tyr Phe Thr Ala 
335 

Met Tyr Val Asn 
350 

Gly He Val Ala 
365 

Leu Lys Phe Glu 
Glu 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
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(vi) ORIGINAL SOURCE: 

" (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

Thr Ser Leu Lys Val Pro Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 

Arg Ser He Thr Val Arg Val 
1 5 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
TTGCGGCACT ATTAGGTG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CGCCSATGCC AAAACGTA 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(ii) MOLECULE TYPE: 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GATCCATGAT GACATATCAC GAAACGCGCG TTTCGTGATA TGTCATCATG 



